README File for "Direct Arabic products' opinions dataset for opinion mining and sentiment analysis" 
==================================================================================================================
The products' opinions in Arabsentiment dataset is collected manually from different social products' resources for opinion mining, feature extraction and sentiment analysis tasks. The collected opinions included different types of direct opinions that include at least one product feature whether it stated explicitly or in implicit manner. 

The dataset contains twenty different products categories like home, baby, different types of software products and other product types. 
Additionally, the products� features are identified manually from the customer opinions and the product description. 
The products are classified according to each product type and there is a specific search query related to each type. 
For each product, the product name and brief description about the product capabilities are registered in products information file and classified to specific product types with a specific initial query for each type. 

The collected data contains opinions about twenty different products' categories. 
These opinions are selected based on the text size and the number of features that appear in the opinionated text. 
For each opinion, we keep track of the opinionated text and the sentiment rating score entered by the customers. The rating score represent the overall polarity of the reviewer towards the products into one of two categories: positive or negative sentiment.
The main dataset attributes involve the total number of directed opinions used in dataset that should include at least one explicit product features, the number of opinions with positive sentiment score is 1459 and negative sentiment polarity score is 516.




